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APPLICANT: Petrini, John H.J. 
Morgan, William Franklin 
Maser, Richard Scott 
Carney, James Patrick 

TITLE OF INVENTION: DNA Encoding A DNA Repair Protein 
FILE REFERENCE: 800.019US1 
CURRENT APPLICATION NUMBER: 10/044,447 
CURRENT FILING DATE: 2002-01-10 
PRIOR APPLICATION NUMBER: US/09/067,641 
PRIOR FILING DATE: 1998-04-27 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO: 1 
LENGTH : 4403 
TYPE: DNA 

ORGANISM: Homo sapiens 
SEQUENCE: 1 

ttcggcacga ggcgcggttg cacgtcggcc ccagccctga ggagccggac cgatgtggaa 60 
actgctgccc gccgcgggcc cggcaggagg agaaccatac agacttttga ctggcgttga 120 
gtacgttgtt ggaaggaaaa actgtgccat tctaattgaa aatgatcagt cgatcagccg 180 
aaatcatgct gtgttaactg ctaacttttc tgtaaccaac ctgagtcaaa cagatgaaat 240 
ccctgtattg acattaaaag ataattctaa gtatggtacc tttgttaatg aggaaaaaat 300 
gcagaatggc ttttcccgaa ctttgaagtc gggggatggt attacttttg gagtgtttgg 360 
aagtaaattc agaatagagt atgagccttt ggttgcatgc tcttcttgtt tagatgtctc 420 
tgggaaaact gctttaaatc aagctatatt gcaacttgga ggatttactg taaacaattg 480 
gacagaagaa tgcactcacc ttgtcatggt atcagtgaaa gttaccatta aaacaatatg 540 
tgcactcatt tgtggacgtc caattgtaaa gccagaatat tttactgaat tcctgaaagc 600 
agttcagtcc aagaagcagc ctccacaaat tgaaagtttt tacccacctc ttgatgaacc 660 
atctattgga agtaaaaatg ttgatctgtc aggacggcag gaaagaaaac aaatcttcaa 720 
agggaaaaca tttatatttt tgaatgccaa acagcataag aaattgagtt ccgcagttgt 780 
ctttggaggt ggggaagcta ggttgataac agaagagaat gaagaagaac ataatttctt 840 
tttggctccg ggaacgtgtg ttgttgatac aggaataaca aactcacaga ccttaattcc 900 
tgactgtcag aagaaatgga ttcagtcaat aatggatatg ctccaaaggc aaggtcttag 960 
acctattcct gaagcagaaa ttggattggc ggtgattttc atgactacaa agaattactg 1020 
tgatcctcag ggccatccca gtacaggatt aaagacaaca actccaggac caagcctttc 1080 
acaaggcgtg tcagttgatg aaaaactaat gccaagcgcc ccagtgaaca ctacaacata 1140 
cgtagctgac acagaatcag agcaagcaga tacatgggat ttgagtgaaa ggccaaaaga 1200 
aatcaaagtc tccaaaatgg aacaaaaatt cagaatgctt tcacaagacg cacccactgt 1260 
aaaggagtcc tgcaaaacaa gctctaataa taatagtatg gtatcaaata ctttggctaa 1320 
gatgagaatc ccaaactatc agctttcacc aactaaattg ccaagtataa ataaaagtaa 1380 
agatagggct tctcagcagc agcagaccaa ctccatcaga aactactttc agccgtctac 1440 
caaaaaaagg gaaagggatg aagaaaatca agaaatgtct tcatgcaaat cagcaagaat 1500 
agaaacgtct tgttctcttt tagaacaaac acaacctgct acaccctcat tgtggaaaaa 1560 
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48 taaggagcag catctatctg agaatgagcc tgtggacaca aactcagaca ataacttatt 1620 

49 tacagataca gatttaaaat ctattgtgaa aaattctgcc agtaaatctc atgctgcaga 1680 

50 aaagctaaga tcaaataaaa aaagggaaat ggatgatgtg gccatagaag atgaagtatt 1740 

51 ggaacagtta ttcaaggaca caaaaccaga gttagaaatt gatgtgaaag ttcaaaaaca 1800 

52 ggaggaagat gtcaatgtta gaaaaaggcc aaggatggat atagaaacaa atgacacttt 1860 

53 cagtgatgaa gcagtaccag aaagtagcaa aatatctcaa gaaaatgaaa ttgggaagaa 1920 

54 acgtgaactc aaggaagact cactatggtc agctaaagaa atatctaaca atgacaaact 1980 

55 tcaggatgat agtgagatgc ttccaaaaaa gctgttattg actgaattta gatcactggt 2040 

56 gattaaaaac tctacttcca gaaatccgtc tggcataaat gatgattatg gtcaactaaa 2100 

57 aaatttcaag aaattcaaaa aggtcacata tcctggagca ggaaaacttc cacacatcat 2160 

58 tggaggatca gatctaatag ctcatcatgc tcgaaagaat acagaactag aagagtggct 2220 

59 aaggcaggaa atggaggtac aaaatcaaca tgcaaaagaa gagtctcttg ctgatgatct 2280 

60 ttttagatac aatccttatt taaaaaggag aagataactg aggattttaa aaagaagcca 2340 

61 tggaaaaact tcctagtaag catctacttc aggccaacaa ggttatatga atatatagtg 2400 

62 tatagaagcg atttaagtta caatgtttta tggcctaaat ttattaaata aaatgcacaa 24 60 
6 3 aactttgatt cttttgtatg taacaattgt ttgtyctgtt ttcaggcttt gtcattgcat 2520 

64 ctttttttca tttttaaatg tgttttgttt attaaatagt taatatagtc acagttcaaa 2580 

65 attctaaatr tacgtaaggt aaaggactaa agtcaccctt ccaccattgt cctagctact 2640 

66 tatttttaaa taatttccta cacaaatgat agcataacat atgcagtgtt ctacaccttg 2700 

67 cttttttact tagtaagatt aaaaattata ggaatatcaa tataatgttt ttaatatttt 2760 

68 ttcttttcca ttatgctgta gtcttaccta aactctggtg atccaaacaa aatggcttca 2820 

69 gtggtgcaga tgtcacctac atgttattct agtactagaa actgaagacc atgtggagac 2880 

70 ttcatcaaac atgggtttag ttttcaccag aatggaaaga cctgtacccc tttttggtgg 2940 

71 tcttactgag ctgggtgggt gtctgttttg agcttattta gagtcctagt tttcctactt 3000 

72 ataaagtaga aatggtgaga ttgttttctt tttctacckt aaagggagat ggtaagaaac 3060 

73 aatgaatgtc ttttttcaaa ctttattgac aagtgatttt caagtctgtg ttcaaaaata 3120 

74 tattcatgta cctgtgatcc agcaagaagg gagttccagt caagagtcac tacaactgat 3180 

75 tagttgttta gagaatgaga aatggaacag tgaggaatgg aggccatatt tccatgactt 3240 

76 cccttgtaaa cagaagcaac agaagggaca agaggctggc ctctacatca ctctcacctt 3300 

77 ccaaatcttg tggaagtgca tctacttgcc agaaccaaat taacttactt ccaagttctg 3360 

78 gctgcttgca ggtggaactc cagctgcaag ggagttaggg aaatgaaggt ctttttttaa 3420 

79 aagcttctca gccttcctag ggaacagaaa ttgggtgagc caatctgcaa tttctactac 3480 

80 aggcattgag accagttaga ttattgaaat attatagaga gttatgaaca cttaaattat 3540 

81 gatagtggta tgacattgga tagaacatgg gatactttag aagtagaatt gacagggcat 3600 

82 attagttgat gaaatggagt catttgagtc tyttaatagc catgtatcat aattaccaag 3660 

83 tgaagctggt ggaacatatg gtctccattt tacagttaag gaatataatg gacagattaa 3720 

84 tattgttytc tgtcatgccc acaatccctt tctaaggaag actgccctac tatagcagtt 3780 

85 tttatatttg tcaatttatg aatataatga atgaggagtt ctggtacctc ctgtctttac 3840 

86 aaatattggg tgttgtccag tatttttccc tttttaaccm ttcccaattc gggtgtgtag 39O0 

87 gtggatgttt ccatttgggt tttaatttgt atatccctga tagctataat tgggtcatag 3960 

88 aaattcttta tacattctag atgcaagtct cttgycggat atacgtattg agatattaca 4020 

89 cctagtctgt ggcttgactg ttttctttat gtcttttgat gaatagaagt tttaaatttt 4080 

90 gacaaggtca aatttatttt tttcttttgt ttgatatttt ttctctccaa tttaacccca 4140 

91 agatttcaga tattctgctc tattatataa actttatatt tttatatttg tgatctacct 4200 

92 tgaattgata tgtatgttgt gaattatgga tcagggttct ttttttcccc catacaagta 4260 

93 tccagtcatt gtaacactgt ttattgaaag aattatcctt tcctcattaa attaccttgc 4320 

94 caattagtaa aaaatcaatt aaccatrmar mmmrrrggat ccactagttc tagagcggcc 4380 

95 gccaccgcgg tggagctcca get 4403 



97 <210> SEQ ID NO: 2 
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98 <:211> LENGTH: 754 

99 <:212> TYPE: PRT 

100 <:213> ORGANISM: Homo sapiens 

101 <:400> SEQUENCE: 2 



H)2 Met Trp Lys Leu Leu Pro Ala Ala Gly Pro Ala Gly Gly Glu Pro Tyr 

10* 1 5 10 15 

104 Arg Leu Leu Thr Gly Val Glu Tyr Val Val Gly Arg Lys Asn Cys Ala 

10 5 20 2 5 3 0 

106 lie Leu lie Glu Asn Asp Gin Ser lie Ser Arg Asn His Ala Val Leu 

10 7 3 5 4 0 45 

108 Thr Ala Asn Phe Ser Val Thr Asn Leu Ser Gin Thr Asp Glu He Pro 

109 50 55 60 

1L0 Val Leu Thr Leu Lys Asp Asn Ser Lys Tyr Gly Thr Phe Val Asn Glu 

111 65 70 75 80 

112 Glu Lys Met Gin Asn Gly Phe Ser Arg Thr Leu Lys Ser Gly Asp Gly 

113 85 90 95 

114 He Thr Phe Gly Val Phe Gly Ser Lys Phe Arg He Glu Tyr Glu Pro 

115 100 105 110 

lib Leu Val Ala Cys Ser Ser Cys Leu Asp Val Ser Gly Lys Thr Ala Leu 

117 115 120 125 

118 Asn Gin Ala lie Leu Gin Leu Gly Gly Phe Thr Val Asn Asn Trp Thr 

119 130 135 140 

120 Glu Glu Cys Thr His Leu Val Met Val Ser Val Lys Val Thr He Lys 

121 145 150 155 160 

122 Thr He Cys Ala Leu He Cys Gly Arg Pro He Val Lys Pro Glu Tyr 

123 165 170 175 

124 Phe Thr Glu Phe Leu Lys Ala Val Gin Ser Lys Lys Gin Pro Pro Gin 

125 180 185 190 

126 He Glu Ser Phe Tyr Pro Pro Leu Asp Glu Pro Ser He Gly Ser Lys 

127 195 200 205 

128 Asn Val Asp Leu Ser Gly Arg Gin Glu Arg Lys Gin He Phe Lys Gly 

129 210 215 220 

130 Lys Thr Phe He Phe Leu Asn Ala Lys Gin His Lys Lys Leu Ser Ser 

131 225 230 235 240 

132 Ala Val Val Phe Gly Gly Gly Glu Ala Arg Leu He Thr Glu Glu Asn 

133 245 250 255 

134 Glu Glu Glu His Asn Phe Phe Leu Ala Pro Gly Thr Cys Val Val Asp 

135 260 265 270 

136 Thr Gly He Thr Asn Ser Gin Thr Leu He Pro Asp Cys Gin Lys Lys 

137 275 280 285 

138 Trp He Gin Ser He Met Asp Met Leu Gin Arg Gin Gly Leu Arg Pro 

139 290 295 300 

140 He Pro Glu Ala Glu He Gly Leu Ala Val He Phe Met Thr Thr Lys 

141 305 310 315 320 

142 Asn Tyr Cys Asp Pro Gin Gly His Pro Ser Thr Gly Leu Lys Thr Thr 

143 325 330 335 

144 Thr Pro Gly Pro Ser Leu Ser Gin Gly Val Ser Val Asp Glu Lys Leu 

145 340 345 350 

146 Met Pro Ser Ala Pro Val Asn Thr Thr Thr Tyr Val Ala Asp Thr Glu 
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196 Arg Arg 

198 <210:> SEQ ID NO: 3 

199 <211> LENGTH: 87 

200 <212> TYPE: PRT 

201 <213> ORGANISM: Homo sapiens 

202 <220> FEATURE: 

20 3 <221> NAME/KEY: UNSURE 

204 <222> LOCATION: (48)... (48 ) 

205 <223> OTHER INFORMATION: Unsure 

206 <400> SEQUENCE: 3 



207 Tyr Val Val Gly Arg Lys Asn Cys Ala lie Leu lie Glu Asn Asp Gin 

208 15 10 15 

209 Ser lie Ser Arg Asn His Ala Val Leu Thr Ala Asn Phe Ser Val Thr 

210 20 25 30 

W--> 211 Asn Leu Ser Gin Thr Asp Glu lie Pro Val Leu Thr Leu Lys Asn Xaa 

212 35 40 45 

213 Lys Tyr Gly Thr Phe Val Asn Glu Glu Lys Met Gin Asn Gly Phe Ser 

214 50 55 60 

215 Arg Thr Leu Lys Ser Val Asp Gly lie Thr Phe Gly Val Phe Gly Ser 

216 65 70 ' ^ 75 80 

217 Lys Phe Arg lie Glu Tyr Glu 

218 85 



220 <210> SEQ ID NO: 4 

221 <211> LENGTH: 87 

222 <212> TYPE: PRT 

223 <213> ORGANISM: Homo sapiens 

224 <400> SEQUENCE: 4 



225 Tyr Ser lie Gly Arg Ser Ser Lys Asn Pro Leu lie lie Lys Asn Asp 

226 15 10 15 

227 Lys Ser lie Ser Arg Gin His lie Thr Phe Lys Trp Glu lie Asn Asn 

228 20 25 30 

229 Ser Ser Asp Leu Lys His Ser Ser Leu Cys Leu Val Asn Lys Gly Lys 

230 35 40 45 

231 Leu Thr Ser Leu Asn Lys Lys Phe Met Lys Val Gly Glu Thr Phe Thr 

232 50 55 60 

233 He Asn Ala Ser Cys Val Leu Lys Ser Thr He Glu Leu Gly Thr Thr 

234 65 70 75 80 

235 Pro He Arg He Glu Phe Glu 

236 85 



238 <210> SEQ ID NO: 5 

239 <211> LENGTH: 13 

240 <.212> TYPE: PRT 

241 <213> ORGANISM: Homo sapiens 

242 <400> SEQUENCE: 5 

243 Asn Pro Ser Gly Leu Asn Asp Asp Tyr Gly Gin Leu Lys 

244 15 10 

246 <210> SEQ ID NO: 6 

247 <211> LENGTH: 680 

248 <212> TYPE: PRT 
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VERIFICATION SUMMARY DATE : 08/14/2002 

PATENT APPLICATION: US/10/044,447 TIME: 15:17:45 

Input Set : N:\Crf3\RULE60\10044447.raw 
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L:211 M:341 W: (46) M n " or "Xaa" used, for SEQ IDtt : 3 
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